Everyone’s Voice Matters: Quantifying Annotation Disagreement Using Demographic Information
نویسندگان
چکیده
In NLP annotation, it is common to have multiple annotators label the text and then obtain ground truth labels based on major annotators’ agreement. However, are individuals with different backgrounds various voices. When annotation tasks become subjective, such as detecting politeness, offense, social norms, voices differ vary. Their diverse may represent true distribution of people’s opinions subjective matters. Therefore, crucial study disagreement from understand which content controversial annotators. our research, we extract five datasets, fine-tune language models predict disagreement. Our results show that knowing demographic information (e.g., gender, ethnicity, education level), in addition task text, helps To investigate effect demographics their level, simulate combinations artificial explore variance prediction distinguish inherent controversy perspective. Overall, propose an innovative mechanism for better design process will achieve more accurate inclusive systems. code dataset publicly available.
منابع مشابه
Detecting Missing Annotation Disagreement using Eye Gaze Information
This paper discusses the detection of missing annotation disagreements (MADs), in which an annotator misses annotating an annotation instance while her counterpart correctly annotates it. We employ annotator eye gaze as a clue for detecting this type of disagreement together with linguistic information. More precisely, we extract highly frequent gaze patterns from the pre-extracted gaze sequenc...
متن کاملQuantifying disagreement in argument-based reasoning
An argumentation framework can be seen as expressing, in an abstract way, the conflicting information of an underlying logical knowledge base. This conflicting information often allows for the presence of more than one possible reasonable position (extension/labelling) which one can take. A relevant question, therefore, is how much these positions differ from each other. In the current paper, w...
متن کاملCrowdsourcing Disagreement for Collecting Semantic Annotation
This paper proposes an approach to gathering semantic annotation, which rejects the notion that human interpretation can have a single ground truth, and is instead based on the observation that disagreement between annotators can signal ambiguity in the input text, as well as how the annotation task has been designed. The purpose of this research is to investigate whether disagreement-aware cro...
متن کاملDisagreement and Information Collection
This note shows that disagreement, in the sense of differing priors, may increase the incentives to collect information when two agents work on a joint project. The reason is that each agent believes that new data will confirm his own beliefs and thus ‘convince’ the other agents to do what the focal agent thinks is right.
متن کاملVoice quality assessment using phase information: Application on voice pathology
One of the most important human abilities is speech along with hearing. Speech is the primary way in which we attune to the society. Our voice can uncover several information about us to other people. It reveals our energy level, our emotions, our personality and our artistry. Voice abnormalities may cause social isolation or may create problems in the professional field. Due to this significan...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i12.26698